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During the years 2000 and 2001 the HERA machine and the HI experiment performed substantial luminosity 
upgrades. To cope with the increased demands on data handling an effort was made to redesign and modernize 
the analysis software. Main goals were to lower turn-around time for physics analysis by providing a single 
framework for data storage, event selection, physics and event display. The new object oriented analysis envi- 
ronment based on the RooT framework provides a data access front-end for the new data storage scheme and 
a new event display. The analysis data is stored in four different layers of separate files. Each layer represents 
a different level of abstraction, i.e. reconstruction output, physics particles, event summary information and 
user specific information. Links between the layers allow correlating quantities of different layers. Currently, 
this framework is used for data analyses of the previous collected data and for standard data production of the 
currently collected data. 
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1. Introduction 

HI is an experiment at the electron proton collider 
HERA at DESY (Germany), which started collecting 
data in 1992. Essential parts of the analysis software 
architecture as well as the basic data model were es- 
tablished ten years ago. Like most of the high energy 
physics experiments at that time HI used FORTRAN 
based packages such as BOS [l| and FPACK for data 
storage and access. Physics analysis were performed 
on so-called n-tuples using HBOOK and PAW 0. 
After the HERA and HI upgrade in the year 2000, 
the increased luminosity put new demands on data 
storage and data handling. Therefore, the HI Collab- 
oration has decided to move towards a new technology 
and to develop an analysis environment which should: 

• lower turn around time for physics analysis 

• provide a unique, modern, extendable and re- 
usable framework 

• incorporate and support all HI physics analyses 

• standardize the physics algorithms, e.g. kine- 
matic reconstruction, selection criteria, particle 
identifications etc. 

• provide one unique code reference and therefore 
facilitate exchange of information between dif- 
ferent analysis groups 

• make expert knowledge reusable by non-experts 
and lower the threshold of starting a new anal- 
ysis 

• provide a faster, more efficient access to the data 

• make doing analyses in HI more attractive to 
students 

To cope with these requirements, the HI Collab- 
oration chose to base its analysis software on the 
object oriented analysis framework RooT RooT 
is based on C++ and provides software for all aspects 



related to physics analysis (i.e. processing, storing 
and visualizing data). In addition a C-l — h interpreter 
allows interactive use of the framework within the 
same scripting language. 



2. The data storage model 

In order to standardize physics analysis and to take 
full advantage of the new partial event reading capa- 
bility of the RooT framework a four layer data struc- 
ture has been implemented. 

• ODS (Object Data Store): This layer is a 
1-1 copy of the reconstruction output formerly 
called 'Data Summary Tape' (DST). It contains 
reconstructed tracks, clusters as well as impor- 
tant detector informations. Even though this 
layer can be stored persistently, the standard is 
to produce it transiently from DST. This way 
people doing analysis in the old and those using 
the new framework can work in parallel with- 
out having to store the same information twice. 
The size of ODS objects is of the order of 13 
kb/ event. 

• //ODS (^Object Data Store): On //ODS 4- 
vectors of physical particles and particle candi- 
dates are stored (see fig. [TJ. The sum of all 
particle candidates provides a 4 II coverage of 
the solid angle around the reconstructed vertex 
and there is no double counting of energy. Each 
of the particles stores also a pointer to the ODS- 
tracks and clusters that it was build from. 
The identified, physical particles contain all par- 
ticle information and some specific detector in- 
formations that was used to identify the particle 
or that might be necessary for further specifi- 
cation during physics analysis. Composed par- 
ticles (as for example jets or J/Psi particles) 
are stored in a special class containing pointers 
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to the related particle candidates or identified 
particles. With the provided information, the 
/iODS is largely sufficient for most analysis pur- 
poses. A mean amount of 3 kb/event has been 
achieved. 

• HAT: The HAT ('HI Analysis Tag') contains 
event summary level information, such as event 
kinematics and particle multiplicities. All vari- 
ables are stored as basic types. The main pur- 
pose it to allow a fast event selection ( "tagging" ) 
for each of the physics analysis. The size is 0.4 
kb/event. 

• User-Tree: To persistenly store user specific in- 
formation that is not already stored on official 
data layers a so-called "user-tree" is supported 
by the framework. 

The ODS layer is filled transiently using DST in- 
formation only. The /iODS is filled by the physics 
algorithm detailed in the section New /iODS and 
HAT are centrally produced whenever new analysis 
algorithms or calibration results are available. 

3. The data access 

The data access is implemented in a set of skeleton 
classes which were developed by following the three 
main requirements: 

• the user has only one single event loop (synchro- 
nization of the different layers is transparent to 
the user) 

• a transparent access to the different levels of 
the data: 'Smart' accessor functions allow to re- 
trieve information about event and particle at- 
tributes across boundaries of different files, e.g. 
AtODS-to-ODS. 

• the access to the data is partial, e.g. accessing a 
cluster on ODS from /iODS should not require 
to read the full ODS event. 

These requirements are implemented in the class 
'HITree' and some helper classes by using different 
RooT trees in parallel, one for each storage level. One 
of these helper classes is 'HIEventList' which is based 
on the RooT TEventList and facilitates the access to 
events according to user selection. HIEventList allows 
to cumulate different selections and to select data on 
each layer. 

4. The physics algorithm 

As HI is a running experiment with an increas- 
ing flow of new data, it is essential that the quality 



and precision of these physics analysis be sustained. 
Therefore the first goal was, while learning from the 
already existing algorithms in FORTRAN, to develop 
and implement algorithms in the new framework with 
better performances than the old ones. To ensure 
quality and extendibility of the new analysis software, 
a modular organization of loosely physics algorithms 
is essential. The aim is to allow for routines developed 
in particular user analyses to be integrated in the of- 
ficial production code and in addition to facilitate the 
physics analysis and to lower the turn-around time for 
beginners. Modularity and portability is a prerequi- 
site for the goal that the best knowledge of all physics 
working groups in HI be propagated into one common 
framework. The interface between the filling code and 
the physics algorithms is structured such that the ad- 
dition of new algorithms involves minimal changes to 
the software. Technically the algorithms are imple- 
mented in separate classes that obey the same inter- 
face. The running is divided into two steps: 

• First, the particle finders reconstruct the identi- 
fied particles and the particle candidates using 
ODS objects only. 

• In a second step, the composed particle identi- 
fiers run on the already reconstructed particles. 

The first category of finders comprises an electron, 
muon, hadron and photon finder. They are based on 
the already existing algorithms in fortran and show 
the same performances in terms of quality and preci- 
sion. In the second category of finders, a jet finder 
using a Kt algorithm as well as a J/ 1 ?, a D* and 
Ko have been implemented and validated. In future, 
new particle finder could cascly be integrated in the 
new scheme and the existings ones are continuously 
be improved. 



5. The event display 

A new event display has been developed. It is an 
application based on the new physics analysis frame- 
work and thus allows for the direct dialogue between 
the analysis part (e.g. event selection and histogram- 
ming) and visual inspection of events. The display 
was originally derived from the Alice 3D RooT display. 
Thanks to the RooT Run Time Information (RTTI), 
objects on the screen can be picked and inspected, 
thus accessing the physics information. Graphics slid- 
ers can be used to apply selections such that only rel- 
evant objects for a certain analysis are displayed. A 
new feature is the possibility to display the particle 
4-vectors stored on /zODS on top of the detector ob- 
jects. For instance, one could display the 4-vectors of 
particles (different particle types are displayed in dif- 
ferent colours) on top of the detector information and 
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Figure 1: Structure of the /iODS level with pointers between the identified and composed particles. 



require minimum transverse momentum for all parti- 
cles by moving a graphic slider. One advantage of the 
new display is his full backward compatibility to the 
old 2D command-line base program based on LOOK 
[f| . Existing code containing expert knowledge about 
detector details is reused, thus fully integrating the 
functionality of the previous display. A new parser 
for the 'LOOK'-macro language was written in CH — h 
It is possible to display event information stored in 
RooT files as well as information stored in the former 
DST format. Raw information, like for example the 
hit information of a reconstructed track, could there- 
fore easely be retrieved. The new program combines 
modern features, such as the GUI, the click and in- 
spect options and the 3D-display with the advantages 
of the old display. 



6. Summary 

A new analysis framework based on object oriented 
programming techniques has been succesfully intro- 
duced. The key to this success was the clear definition 
of the scope of the project: 

• A code development group takes care of the 
technical challenges, such as encapsulation of 



the data handling. The physics working groups 
develop algorithms and add their code via well 
defined interfaces. The main reconstruction al- 
gorithms have been implemented, tested and 
validated. 

• The end users obtain a nice and easy-to-use 
product integrating all analysis specific tools 
into one single framework. 
The physics analysis all greatly profit from the new 
and enhanced analysis environment. The framework 
is widely accepted within the Collaboration: almost 
all of the new starting analyses are based on the frame- 
work. It is in addition used for the official data quality 
checks. 
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